NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Low-Precision Streaming PCA

Dasgupta, Sanjoy; Kumar, Syamantak; Pandey, Shourya; Sarkar, Purnamrita (January 2026, Advances of Neural Information Processing Systems 2025.)

Free, publicly-accessible full text available January 31, 2027
A neural algorithm for computing bipartite matchings

https://doi.org/10.1073/pnas.2321032121

Dasgupta, Sanjoy; Meirovitch, Yaron; Zheng, Xingyu; Bush, Inle; Lichtman, Jeff W; Navlakha, Saket (September 2024, Proceedings of the National Academy of Sciences)

Finding optimal bipartite matchings—e.g., matching medical students to hospitals for residency, items to buyers in an auction, or papers to reviewers for peer review—is a fundamental combinatorial optimization problem. We found a distributed algorithm for computing matchings by studying the development of the neuromuscular circuit. The neuromuscular circuit can be viewed as a bipartite graph formed between motor neurons and muscle fibers. In newborn animals, neurons and fibers are densely connected, but after development, each fiber is typically matched (i.e., connected) to exactly one neuron. We cast this synaptic pruning process as a distributed matching (or assignment) algorithm, where motor neurons “compete” with each other to “win” muscle fibers. We show that this algorithm is simple to implement, theoretically sound, and effective in practice when evaluated on real-world bipartite matching problems. Thus, insights from the development of neural circuits can inform the design of algorithms for fundamental computational problems.
more » « less
Full Text Available
Reducing Catastrophic Forgetting With Associative Learning: A Lesson From Fruit Flies

https://doi.org/10.1162/neco_a_01615

Shen, Yang; Dasgupta, Sanjoy; Navlakha, Saket (October 2023, Neural Computation)

Abstract Catastrophic forgetting remains an outstanding challenge in continual learning. Recently, methods inspired by the brain, such as continual representation learning and memory replay, have been used to combat catastrophic forgetting. Associative learning (retaining associations between inputs and outputs, even after good representations are learned) plays an important function in the brain; however, its role in continual learning has not been carefully studied. Here, we identified a two-layer neural circuit in the fruit fly olfactory system that performs continual associative learning between odors and their associated valences. In the first layer, inputs (odors) are encoded using sparse, high-dimensional representations, which reduces memory interference by activating nonoverlapping populations of neurons for different odors. In the second layer, only the synapses between odor-activated neurons and the odor’s associated output neuron are modified during learning; the rest of the weights are frozen to prevent unrelated memories from being overwritten. We prove theoretically that these two perceptron-like layers help reduce catastrophic forgetting compared to the original perceptron algorithm, under continual learning. We then show empirically on benchmark data sets that this simple and lightweight architecture outperforms other popular neural-inspired algorithms when also using a two-layer feedforward architecture. Overall, fruit flies evolved an efficient continual associative learning algorithm, and circuit mechanisms from neuroscience can be translated to improve machine computation.
more » « less
Full Text Available
Data-Copying in Generative Models: A Formal Framework

Bhattacharjee, Robi; Dasgupta, Sanjoy; Chaudhuri, Kamalika (July 2023, Proceedings of Machine Learning Research)

There has been some recent interest in detecting and addressing memorization of training data by deep neural networks. A formal framework for memorization in generative models, called “data-copying” was proposed by Meehan et. al (2020). We build upon their work to show that their framework may fail to detect certain kinds of blatant memorization. Motivated by this and the theory of non-parametric methods, we provide an alternative definition of data-copying that applies more locally. We provide a method to detect data-copying, and provably show that it works with high probability when enough data is available. We also provide lower bounds that characterize the sample requirement for reliable detection.
more » « less
Full Text Available
Constants Matter: The Performance Gains of Active Learning

Mussmann, Stephen O; Dasgupta, Sanjoy (January 2023, Proceedings of the 39th International Conference on Machine Learning)

Full Text Available
A neural theory for counting memories

https://doi.org/10.1038/s41467-022-33577-2

Dasgupta, Sanjoy; Hattori, Daisuke; Navlakha, Saket (December 2022, Nature Communications)

Abstract Keeping track of the number of times different stimuli have been experienced is a critical computation for behavior. Here, we propose a theoretical two-layer neural circuit that stores counts of stimulus occurrence frequencies. This circuit implements a data structure, called a count sketch , that is commonly used in computer science to maintain item frequencies in streaming data. Our first model implements a count sketch using Hebbian synapses and outputs stimulus-specific frequencies. Our second model uses anti-Hebbian plasticity and only tracks frequencies within four count categories (“1-2-3-many”), which trades-off the number of categories that need to be distinguished with the potential ethological value of those categories. We show how both models can robustly track stimulus occurrence frequencies, thus expanding the traditional novelty-familiarity memory axis from binary to discrete with more than two possible values. Finally, we show that an implementation of the “1-2-3-many” count sketch exists in the insect mushroom body.
more » « less
Full Text Available
Framework for evaluating faithfulness of local explanations

Dasgupta, Sanjoy; Frost, Nave; Moshkovitz, Michal (January 2022, International Conference on Machine Learning)

We study the faithfulness of an explanation system to the underlying prediction model. We show that this can be captured by two properties, consistency and sufficiency, and introduce quantitative measures of the extent to which these hold. Interestingly, these measures depend on the test-time data distribution. For a variety of existing explanation systems, such as anchors, we analytically study these quantities. We also provide estimators and sample complexity bounds for empirically determining the faithfulness of black-box explanation systems. Finally, we experimentally validate the new properties and estimators.
more » « less
Full Text Available
Rethinking Logic Minimization for Tabular Machine Learning

https://doi.org/10.1109/TAI.2022.3224415

Qiao, Litao; Wang, Weijia; Dasgupta, Sanjoy; Lin, Bill (January 2022, IEEE Transactions on Artificial Intelligence)

Full Text Available
A Theoretical Perspective on Hyperdimensional Computing

https://doi.org/10.1613/jair.1.12664

Thomas, Anthony; Dasgupta, Sanjoy; Rosing, Tajana (September 2021, Journal of Artificial Intelligence Research)

Hyperdimensional (HD) computing is a set of neurally inspired methods for obtaining highdimensional, low-precision, distributed representations of data. These representations can be combined with simple, neurally plausible algorithms to effect a variety of information processing tasks. HD computing has recently garnered significant interest from the computer hardware community as an energy-efficient, low-latency, and noise-robust tool for solving learning problems. In this review, we present a unified treatment of the theoretical foundations of HD computing with a focus on the suitability of representations for learning.
more » « less
Full Text Available
Teaching a black-box learner

Dasgupta, Sanjoy; Hsu, Daniel; Poulis, Stefanos; Zhu, Xiaojin (January 2019, International Conference on Machine Learning)

Full Text Available

« Prev Next »

Search for: All records